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Abstract 

Kucera and Gacs independently showed that every infinite sequence is Turing 
reducible to a Martin-Lof random sequence. This result is extended by showing 
that every infinite sequence S is Turing reducible to a Martin-Lof random sequence 
R such that the asymptotic number of bits of R needed to compute n bits of S, 
divided by n, is precisely the constructive dimension of S. It is shown that this is 
the optimal ratio of query bits to computed bits achievable with Turing reductions. 
As an application of this result, a new characterization of constructive dimension is 
given in terms of Turing reduction compression ratios. 
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1 Introduction 

An (infinite, binary) sequence S is Turing reducible to a sequence R, written S <t R, if 
there is an algorithm M that can compute S, given oracle access to R. Any computable 
sequence is trivially Turing reducible to any other sequence. Thus, if S <t R, then 
intuitively we can consider R to contain the uncomputable information that M needs to 
compute S. 

Informally, a sequence is Martin-Lof random [Mar66' if it has no structure that can 
be detected by any algorithm. Kucera |Kuc85t lKuc89ij and Gacs |Gac86j independently 



obtained the surprising result that every sequence is Turing reducible to a Martin-Lof 
random sequence. Thus, it is possible to store information about an arbitrary sequence 
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S into another sequence R, while ensuring that the storage of this information imparts 
no detectable structure on R. In the words of Gacs, "it permits us to view even very 
pathological sequences as the result of the combination of two relatively well-understood 
processes: the completely chaotic outcome of coin-tossing, and a transducer algorithm." 
Merkle and Mihailovic jMM04j have provided a simpler proof of this result using martin- 
gales, which are strategies for gambling on successive bits of a sequence. 

Bennett jBenSSj claims that "This is the infinite analog of the far more obvious fact 
that every finite string is computable from an algorithmically random string (e.g., its 
minimal program)." However, the analogy is incomplete. Not only is every string s 
computable from a random string r, but r is an optimally compact representation of s. 
Viewing the sequence i? as a compressed representation of the sequence S, the asymptotic 
number of bits of R needed to compute n bits of S, divided by n, defines the compression 
ratio between them. Gacs showed that his reduction achieves a compression ratio of 1: 
for any n, n + o{n) bits of R are required to compute n bits of S. But as in the case of 
strings, sequences that are sparse in information content should in principle be derivable 
from a more compact description. 

Lutz |Lut03bj defined the (constructive) dimension dim(S') of a sequence 5* as an 
effective version of Hausdorff dimension (the most widely-used fractal dimension; see 
[Haul 91 EEO])- Construct ive dimension is a measure of the "density of computably 
enumerable information" in a sequence. Lutz defined dimension in terms of constructive 
gales, a generalization of martingales. Mayordomo |May02| proved that for all sequences 
5, dim(5') = liminf„^oo ^^'^^"^ , where K(5' \ n) is the Kolmogorov complexity of the n^^ 
prefix of S. 

Athreya et al. |AHLM06] , also using gales, defined the ( constructive ) strong dimension 
Dim(S') of a sequence S as an effective version of packing dimension (see jTri82| ISul841 
IFal90j ) ■ another type of fractal dimension and a dual of Hausdorff dimension. They 
proved the analogous characterization Dim(S') = lim sup^^g^ k(5 r») _ gj^ce Kolmogorov 
complexity is a lower bound on the algorithmic compression of a finite string, dim(S') and 
Dim(S') can respectively be considered to measure the best- and worst-case compression 
ratios achievable on finite prefixes of S. 

Consider the following example. It is well known that K, the characteristic sequence 
of the halting language, has dimension and strong dimension BarGB- The binary 
representation of Chaitin's halting probability Vt = YIm halts 2^'^^' (where M ranges over 
all halting programs and \M\ is M's description length) is an algorithmically random 
sequence |Cha75j . It is known that K <t (see |LV97j ). Furthermore, only the first n 
bits of Q are required to compute the first 2" bits of K, so the asymptotic compression 
ratio of this reduction is 0. f2 can be considered an optimally compressed representation 
of K, and it is no coincidence that the compression ratio of achieved by the reduction 
is precisely the dimension of K. 

We generalize this phenomenon to arbitrary sequences, extending the result of Kucera 
and Gacs by pushing the compression ratio of the reduction down to its optimal lower 
bound. Thus, this paper completes Bennett's above-mentioned analogy between reduc- 
tions to random sequences and reductions to random strings. Compression can be mea- 
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sured by considering both the best- and worst-case hmits of compression, corresponding 
respectively to measuring the hmit inferior and the hmit superior of the compression ra- 
tio on longer and longer prefixes of S. We show that, for every sequence S, there is a 
sequence R such that S <t R, where the best-case compression ratio of the reduction 
is the dimension of S, and the worst-case compression ratio is the strong dimension of 
S. Furthermore, we show that the sequence R can be chosen to be Martin-Lof random, 
although the randomness of R is easily obtained by invoking the construction of Gacs 
in a black-box fashion. The condition that R is random is introduced chiefly to show 
that our main result is a strictly stronger statement than the result of Kucera and Gacs, 
but the compression is the primary result. Finally, a single machine works in all cases; 
as is the case with Kolmogorov complexity, a single Turing reduction reproduces each 
sequence S from its shortest description. Our result also extends a compression result of 
Ryabko |Rya84 Rya86| , discussed in section |21 although it is not a strict improvement, 
since Ryabko considered two-way reductions (Turing equivalence) rather than one-way 
reductions. 

One application of this result is a new characterization of constructive dimension as 
the optimal compression ratio achievable on a sequence with Turing reductions. This 
compression characterization differs from Mayordomo's Kolmogorov complexity charac- 
terization in that the compressed version of a prefix of S does not change drastically 
from one prefix to the next, as it would in the case of Kolmogorov complexity. While 
the theory of Kolmogorov complexity assigns to each finite string an optimally compact 
representation of that string - its shortest program - this does not easily allow us to 
compactly represent an infinite sequence with another infinite sequence. This contrasts, 
for example, the notions of finite-state compression |Huf59j or Lempel-Ziv compression 
|ZL78j . which are monotonic: for all strings x and y, x ^ y [x is a. prefix of y) implies 
that C{x) C C{y), where C{x) is the compressed version of x. Monotonicity enables these 
compression algorithms to encode and decode an infinite sequence - or in the real world, 
a data stream of unknown length - online, without needing to reach the end of the data 
before starting. However, if we let 7r(x) and T^ly) respectively be shortest programs for x 
and y, then x ^ y does not imply that 7t{x) C vr(?/). In fact, it may be the case that 7t{x) 
is longer than 7r(y), or that 7r(a;) and Tc{y) do not even share any prefixes in common. 
In the self-delimiting formulation of Kolmogorov complexity, tt{x) cannot be a prefix of 

Our characterization of sequence compression via Turing reductions, coupled with the 
fact that the optimal compression ratio is always achievable by a single oracle sequence 
and reduction machine, gives a way to associate with each sequence S another sequence 
R that is an optimally compressed representation of S. As in the case of Kolmogorov 
complexity, the compression direction is in general uncomputable; it is not always the 
case that R <t S. 
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2 Preliminaries 



2.1 Notation 

All logarithms are base 2. We write M, Q, Z, and N for the set of all reals, rationals, 
integers, and non-negative integers, respectively. For ACM, A'^ denotes A fl (0, oo). 

{0, 1}* is the set of all finite, binary strings. The length of a string x G {0, 1}* 
is denoted by A denotes the empty string. Let ao,ai,a2, ■ ■ ■ G {0,1}* denote the 
standard enumeration of binary strings ctq = A, cti = 0, a2 = 1, as = 00, . . .. For k E N, 
{0, l}'^ denotes the set of all strings x G {0, 1}* such that = k. The Cantor space 
C = {0, 1}°° is the set of all infinite, binary sequences. For x G {0, 1}* and y G {0, 1}*UC, 
xy denotes the concatenation of x and y, and x ^ y denotes that x is a prefix of y; i.e., 
there exists u G {0, 1}* U C such that xu = y. For S G {0, 1}* U C and i,j G N, we write 
S[i] to denote the i*'^ bit of 5", with S[0] being the leftmost bit, we write S[i . . j] to denote 
the substring consisting of the i^^ through j^^ bits of S (inclusive), with S[i . .j] = X if 
i > j, and we write f z to denote S[0 . .i — 1]. 

2.2 Kolmogorov Complexity 

We work with the self-delimiting Kolmogorov complexity. See |LV97j for an account of 
this model. All Turing machines are self-delimiting. This means that 

• a Turing machine M is allowed to move its input tape head only to the right, and 

• if M does not halt with its tape head on the rightmost bit of its input, the compu- 
tation is considered invalid. 

Fix a self-delimiting universal Turing machine U. Let x G {0, 1}*. The Kolmogorov 
complexity of x is 

K{x) = min { |7r| | U{n) = x} . 

7rG{0,l}* 

For all g G Q, let K(g) = K(s(g)), where s{q) G {0, 1}* is some standard binary represen- 
tation of the rational q with a numerator, denominator, and sign bit. 

For all w G {0,1}*, let eo{w) = O^^^lw. Define the self-delimiting encoding function 
enc : {0, 1}* ^ {0, 1}* for all w G {0, 1}* by 

enc{w) = Co {a\^\) w. 

For all n G N, let enc(n) = enc(cr„). 

Strings encoded by enc and valid programs for U are self- delimiting. They can be 
prepended to arbitrary strings and uniquely decoded. 

Observation 2.1. For all x G {0,1}*, |enc(2:)| < |x| + 21og|x| + 3, and for all n G M, 
enc(n) < log n + 2 log log n + 3. 
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Our results, being asymptotic in nature, do not depend crucially on using the self- 
delimiting Kolmogorov complexity K; it is simply more convenient for encoding purposes. 
All results would work out the same if we instead use the plain Kolmogorov complexity 
C : {0,1}* — > N (see |LV97j ). Whenever we would need to add a program to a string and 
retain the ability to uniquely decode it, we could simply encode the program using the 
function enc. 

2.3 Reductions and Compression 

Let M be a Turing machine and 5" G C. We say M computes S if, on input n G N, M 
outputs the string S \ n. 

We define an oracle Turing machine (OTM) to be a Turing machine M that can make 
constant-time queries to an oracle sequence, and we let OTM denote the set of all oracle 
Turing machines. For i? G C, we say M operates with oracle R if, whenever M makes a 
query to index n G N, the bit R[n\ is returned. 

Let S,R E C and M G OTM. We say S is Turing reducible to R via M, and we write 
S <T R via M, if M computes S with oracle R. In this case, define M{R) = S. We say 
S is Turing reducible to R, and we write S <t R, if there exists M G OTM such that 
S <T R via M. 

Since we do not consider space or time bounds with Turing reductions, we may assume 
without loss of generality that an oracle Turing machine queries each bit of the oracle 
sequence at most once, caching the bit for potential future queries. 

Let S,P,R e C and M|, G OTM such that S <t P via and P <t R via 
Mp. Define the composition of Mg with Mp, denoted o Mp, to be the oracle Turing 
machine that works as follows. On input n G N and with oracle R, Mg o Mp simulates 
Mg to compute S \ n. Whenever a bit of P is queried by Mg , Mg o Mp simulates Mp 
with oracle R for the minimum number of steps needed to compute that bit of P. 

Observation 2.2. <t is transitive: if S <t P via Mg and P <t R via Mp, then S <t R 
via M| o 

In order to view Turing reductions as decompression algorithms, we must define how 
to measure the amount of compression achieved. Let S, R & C and M G OTM such that 
S <T R via M. Define i^g{M, n) to be the query usage of M on S \ n with oracle R, the 
number of bits of R queried by M when computing S \ n} Define 

Pm{S,R) = hmmf , 

n^oo fl 

pUS,R) = hmsup^t^^. 

n— >oo ri 

Pm{S, R) and Pm{S, R) are respectively the best- and worst-case compression ratios as M 
decompresses R into S. Note that < Pm{S, R) < Pm{S, R) < oo. Let 5* G C. The lower 

^If we instead defined #g (Af, n) to be the index of the rightmost queried bit (i.e., assuming that if a 
bit is queried, all bits to the left of it are also queried) , all results of the present paper would still hold. 
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and upper compression ratios of S are respectively defined 

p-{S) = mill {pJj{S, R)\ S <tR via M} , 

AfeOTM 

p+{S) = mill {pIj{S, R)\ S <tR via M} . 
mgotm 

Note that < p^{S) < p^{S) < 1. As we will see, by Lemma 14.21 and Theorem 14.31 the 
two minima above exist. In fact, there is a single OTM M that achieves the minimum 
compression ratio in each case. 

2.4 Constructive Dimension 

See |Lut03at ILut03b| IAHLM061 ILutOSj for a more comprehensive account of the theory 
of constructive dimension and other effective dimensions. 

1. An s-gale is a function d : {0, 1}* — > [0, oo) such that, for all w G {0, 1}*, 

d{w) = 2-'[d{w0) + d{wl)]. 

2. A martingale is a 1-gale. 

Intuitively, a martingale is a strategy for gambling in the following game. The gambler 
starts with some initial amount of capital (money) d{X), and it reads an infinite sequence 
S of bits. d{w) represents the capital the gambler has after reading the prefix w ^ S. 
Based on w, the gambler bets some fraction of its capital that the next bit will be and the 
remainder of its capital that the next bit will be 1. The capital bet on the bit that appears 
next is doubled, and the remaining capital is lost. The condition d{w) = 
ensures fairness: the martingale's expected capital after seeing the next bit, given that it 
has already seen the string w, is equal to its current capital. The fairness condition and 
an easy induction lead to the following observation. 

Observation 2.3. Let /c G N and let d : {0, 1}* [0, oo) be a martingale. Then 

J2 d{u)=2^d{X). 

An s-gale is a martingale in which the capital bet on the bit that occurred is multiplied 
by 2'^, as opposed to simply 2, after each bit. The parameter s may be regarded as the 
unfairness of the betting environment] the lower the value of s, the faster money is taken 
away from the gambler. Let d : {0,1}* [0, cxo) be a martingale and let s G [0,cx)). 
Define the s-gale induced by d, denoted d^'^\ for all w G {0, 1}* by 

S'\nj) = 2^'-'^\'"\d{w). 

If a gambler's martingale is given by d, then, for all s G [0, oo), its s-gale is d^^\ 

The following theorem, due to Lutz, establishes an upper bound on the number of 
strings on which an s-gale can perform well. 
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Theorem 2.4. jLut03aj Let d he an s-gale. Then for all w G {0, 1}*, A; G N, and a G M+, 

there are fewer than ^ strings u G {0, l}'^ for which 

Corollary 2.5. Let d he a martingale. Then for all I & M., w E {0, 1}*, A; G N, and 
a G M^, i/iere are fewer than ^ strings u E {0, l}'^ /or which 

d{wu) > a2^-U{w). 

Let S" G C, s G [0, oo), and let d : {0, 1}* [0, oo) be an s-gale. d succeeds on S, and 
we write S G S°^[d], if 

limsup(i(S' \ n) = oo. 

n—*oo 

d strongly succeeds on S, and we write S G S^Jc?], if 

liminf d{S \ n) = oo. 

Jl— >oo 

An s-gale succeeds on S if, for every amount of capital C G M"*", it eventually makes 
capital at least C. An s-gale strongly succeeds on S if, for every amount of capital C, it 
eventually makes capital at least C and stays above C forever. 

Let d : {0,1}* [0, C)o) be an s-gale. We say that d is constructive (a.k.a. lower 
semicomputahle, suhcomputahle) if there is a computable function d : {0,1}* x N ^ Q 
such that, for all w G {0, 1}* and t G N, 

1. d{w, t) < d{w, t + 1) < d{w), and 

2. lim d{w, t) = d{w). 

t—*oo 

Let G C. We say that R is Martin- L of random, and we write R G RAND, if 
there is no constructive martingale d such that R G S°°[(i]. This definition of Martin-Lof 
randomness, due to Schnorr [ SchTlj . is equivalent to Martin-Lof 's traditional definition 
fsee |Mar66[lLV97l ). 

The following well-known theorem (see jMM04j ) says that there is a single constructive 
martingale that strongly succeeds on every S ^ RAND. 

Theorem 2.6. |MM04j There is a constructive martingale d such that S^j,[d] = RAND*^. 

Let d : {0, 1}* x N — ^> Q be the computable function testifying that d is constructive. 
The following theorem, due independently to Hitchcock and Fenner, states that d^*) 
is "optimal" for the class of constructive t-gales whenever s > t. 

Theorem 2.7. |Hit03t lFen02j Let s > f G M"*", and let d he a constructive t-gale. Then 

S°°[rf] C S°°[d(*)] and SZ[d] C SZ[d^'^]. 
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By Theorem 1213 the following definition of constructive dimension is equivalent to the 
definitions given in |Lutn8b[ lAHLMOH] . Let X C C. The constructive dimension and the 
constructive strong dimension of X are respectively defined 

cdim(X) = inf{s e[0,oo)\X C S°°[S'^]}, 
cDim(X) = inf{s G [0, oo) | X C S.^Jd^*)]}. 

Let S E C. The dimension and the strong dimension of S are respectively defined 

dim(5') = cdimd^"}), 
Dim(5) = cDim({5}). 

Intuitively, the (strong) dimension of S is the most unfair betting environment s in which 
the optimal constructive gambler d (strongly) succeeds on S. 

Observation 2.8. Let S E C If s > dim(iS') and s' > Dim(S'), then for infinitely many n, 
d{S \ n) > 2(i-^)"d(A), and for all but finitely many n, d{S \ n) > 2^^-'>d{X). 
Observation 2.9. If 5 G RAND, then dim(5) = Dim{S) = 1. 

The following theorem - the first part due to Mayordomo |May02| and the second to 
Athreya et al. |AHLMn6j - gives a useful characterization of the dimension of a sequence 



in terms of Kolmogorov complexity, and it justifies the intuition that dimension measures 
the density of computably enumerable information in a sequence. 

Theorem 2.10. |May02| IXHLM06] For all S eC, 

dim(iS') = liminf — — 1^ ^^^^ Dim(S') = limsup , ^ 



n n — ^oo ri 

One of the most important properties of constructive dimension is that of absolute 
stability, shown by Lutz |Lut03bj . which allows us to reason equivalently about the con- 
structive dimension of individual sequences and sets of sequences: 

Theorem 2.11. |Lutn3bj For all X CC, 

cdim(X) = sup dim(S'), and cDim(X) = sup Dim(S'). 

3 Previous Work 

The next theorem says that every sequence is Turing reducible to a random sequence. 
Part 1 is due independently to Kucera and Gacs, and part 2 is due to Gacs. 

Theorem 3.1. |Kuc85l IKuc89l K;ac86j There is an OTM M such that, for all S e C, 
there is a sequence R e RAND such that 

1. S <T R via M. 

2. pI,{S,R) = 1. 

Let X C C. Define the code cost of X by 



ct(X) = inf I sup pl, (5, Me (5)) 



(V5 G X) M,{M,{S)) = S 
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ct{X) is the optimal lower compression ratio achievable with reversible Turing reductions 
on sequences in X. The next theorem is due to Ryabko |Rya84t Rya86| . 

Theorem 3.2. |Rya86| For every X CC, ct{X) = cdim(X). 

Ryabko defined ct based on what he calls "T-codes" and did not explicitly mention 
OTMs, but these are essentially equivalent. A T-code is a pair of encoder /decoder (i.e. 
compressor /decompressor) algorithms E,D : {0,1}* {0,1}* - implemented by the 
Turing machines Mg and Md in the present paper's definition of ct - which are required 
to be monotonia: for all x,y & {0, 1}*, 



x^y ^ E{x) □ E{y) and D{x) □ D{y). 



Me and Md can be considered OTMs that always make queries to entire prefixes of the or- 
acle sequence, which is represented by the input string to the compression/decompression 
algorithm. The OTM's input n, which represents the size of the output prefix to compute, 
is then implicitly the number of bits output by Mg or M^. By restricting the behavior of 
an OTM in this way, the query usage necessarily counts all oracle bits to the left of any 
bit that gets queried, in addition to the queried bit. In other words, the query usage was 
implicitly defined by Ryabko to be the index of the rightmost queried bit, as opposed to 
the number of bits queried. All results of the present paper hold if query usage is instead 
defined in this manner. 

To define a lower compression ratio, instead of considering the liminf over all bit 

n— >oo 

positions n in S, which is how p~ is defined, Ryabko considered the lim sup over all block 

i— >oo 

positions (i.e. subsequences of bit positions), where < ni < n2 < < ■ ■ .. He then 
included the block positions as part of the specification of the T-code, by requiring the 
Turing machines to read their input and produce output in sequential blocks. Therefore 
the optimization over all pairs of encoding/decoding machines Me,Md in the current 
paper's definition of ct simultaneously optimizes over all subsequences of bit positions at 
which to measure the compression ratio. It is routine to verify that the infimum over all 
subsequences of bit positions {ni}°Zi of the lim sup over the positions {rij}^]^ is exactly 

the liminf over all bit positions n. 

n— >oo 

1 11 -: constructive dimension as defined by Lutz |Lut03bj had not yet been defined 



at the time Ryabko wrote |Rya86| . He in fact showed that, for all X C C, ct(X) 
sup liminf M^^. By Theorems 12 . 1 II and 12. 101 the right hand side is cdim(X). 

S€X "^o° " 

Theorem 13.21 achieves weaker compression results than the main results of this paper. 
Theorems 14.31 and 14.61 Theorem 13.21 does not include p'^ or cDim, and it requires opti- 
mizing over all OTMs. However, unlike Theorem 14. 3[ in which only the decompression is 
computable, the compression achieved in Theorem 13.21 is computable, by the definition of 
Ct- 



9 



4 Results 

We now establish the new results. 

The following lemma shows two senses in which the composition of two oracle Turing 
machines in a transitive Turing reduction bounds the compression ratio of the transitive 
reduction below the product of the compression ratios of the two original reductions. 

Lemma 4.1. Let S,P,R e C and M^, M| e OTM such that S <t: P via and 
P <T R via M^, and let M ^ o M^, so that S <t R via M. Then 

pt,{S,R)<p+^,{S,P)p+^^{P,R), 

and 

Pm{S,R) < p-^,{S,P)pl^{P,R). 

Proof. Let rg~^ > p'l^piS, P), r^"" > p~^p{S, P), and rp+ > P'Ijr{P, R)- It suffices to show 

that pIi{S, R) < r|'+r^+ and p^iS, R) < r^'r^^. 

For infinitely many n, #5 (M^, n) < r^'n. For all but finitely many n, ^g{Mg, n) < 
Tg^n, and :^p{Mp,n) < rp'^n. Then, for all but finitely many n, to compute S \ n, M 
requires 

#f (M, n) = #^ (M|, #f (Ml, n)) < r^+#f (mJ, n) < r^^r^^n 
queries to R. Since this holds for all but finitely many n, 

p+{S, R) = hmsup *s{M,n) ^ ^p+^^+_ 

n— >oo 1^ 

For infinitely many n, to compute S \ n, M requires 

#f(M,n) = #^ (M^,#f (M|',n)) < rf+#f (M|',n) < rf-rf+n 
queries to R. Since this holds for infinitely many n, 

p-^{S, R) = hminf < r^-r^+. 

n—*oo n 

□ 

An OTM that computes a sequence S, together with a finite number of oracle bits 
that it queries, is a program to produce a prefix of 5*. Thus, the query usage of the Turing 
machine on that prefix cannot be far below the Kolmogorov complexity of the prefix. 
This is formalized in the following lemma, which bounds the compression ratio below by 
dimension. 

Lemma 4.2. Let S,ReC and M e OTM such that S <t R via M. Then 

Pm{S, R) > dim(5), and p^^iS, R) > Dim(,5). 
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Proof. Let ttm be a self-delimiting program for M, so that, for all x G {0, 1}*, U{'Kmx) = 
M(x). Let r„ G {0, l}#s(^'"-) be the oracle bits of R queried by M on input n, in the 
order in which they are queried. Recall the self-delimiting encoding function enc. For 
each n E N, let Hn = 7rM'vrMenc(r;,)enc(r„), where ttm' is a self-delimiting program that 
simulates M, encoded by ttm, on input n, encoded by enc(n), with oracle R, encoded by 
enc(rri). When M makes its i^^ query to a bit of R, the bit r„[z] is returned. Since M 
queries each bit of R at most once (see section |21), the bit from r„ will be correct, no 
matter what index was queried by M, since the bits of r„ are arranged in the order in 
which M makes its queries. 

Then [/(7r„) = S \ n, so K{S \ n) < |7r„|. By Theorem imH 

dim(5') = liminf- ^ ^ 



n^oo fl 

7rAf'7rMenc(n)enc(r„) 



< lim inf 



< lim inf 



IvTM'VTAfl +logn + 21oglogn + #|(M, n) + 21og#|(M, n) + 6 



n 



liminfM^^ 



= pl.j{S,R), 

and similarly, Dim(5') < p'lj{S,R). □ 

The next theorem is the main result of this paper. It shows that the compression 
lower bounds of Lemma 14.21 are achievable, and that a single OTM M suffices to carry 
out the reduction, no matter which sequence S is being computed. Furthermore, the 
oracle sequence R to which 5" reduces can be made Martin-Lof random. The randomness 
of R is easily accomplished by invoking the construction of Gacs in a black-box fashion; 
the majority of the work in the proof is establishing the bound on the compression. 

Theorem 4.3. There is an OTM M such that, for all S E C, there is a sequence R G 
RAND such that 

1. S <T R via M. 

2. Pm{S,R) = dim(5). 

3. Pm{S,R) = Dim(5). 



Proof idea: If the dimension of S is small, then the optimal constructive martingale 
d performs well on S. Thus, if we have already computed a prefix 5" f n of S", then 
on average, d increases its capital more on the next k bits of 5* than it would on other 
k-hit strings that could extend S \ n. This places the next k bits of S" in a small (on 
average) subset of {0, l}'^, namely, those strings on which d increases its capital above 
a certain threshold which is chosen to be slightly smaller than d{S \ {n + k)), the 
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amount of capital made after the next k bits of S. Since d is constructive, it is possible to 
enumerate strings from this small set by evaluating the computable function d in parallel 
on all possible length- /c extensions of \ n, and outputting a string u G {0,1}'^ when 
d{{S \ n)u,t) is greater than dn, for some value of t G N. We will encode the next k 
bits of S as an index into this set, where the index will represent the order in which this 
parallel evaluation enumerates the string we want - the next k bits of S. This technique 
is similar to that used by Merkle and Mihailovic |MM04j to prove Theorem 13.11 



We require two lemmas to prove Theorem 14.31 Lemma 14.41 shows that the average 
number of bits needed to encode the index of a length-fc extension of 5* f n is close to 
the dimension of S times k. We will also need to encode the threshold dn into the oracle 
sequence, since the actual amount of capital that d will make is uncomputable. Lemma 
14.51 shows that we can find a rational threshold dn that requires so few bits to represent 
that it will not affect the compression ratio when added to the oracle sequence, yet which 
is still a close enough approximation to d{S \ {n + k)) to keep the index length of Lemma 
l44l small. 



Lemma 4.4. Let S E C. For all i G N, define ki = i + 1, and define uq = and 
rii = ni_i + ki = iiiiil for i > 0. Let do,di, . . . be a sequence of real numbers such that, 
for all ieN, di> d{S \ Ui) (l - ^) . Define Ai C {0, l}'^" by 

A, = {tiG{0,lf' I d{{S \ n,_,)u) > d,} . 

Then 

liminf — — — — < dim(S'), and limsup — — — — < Dim(S'). 

Proof. We show the result for dim(S'). The proof for Dim(iS') is similar, replacing "for 
infinitely many i" conditions with "for all but finitely many i." 

The indices uq < ni < n2 < . . . partition 5" into blocks S[nQ . .ni — 1], S[ni . .n2 — I], 
. . ., with ki = nj+i — Ui equal to the length of the i^^ block, and Ui equal to the length of 
the first i + 1 blocks. 

Let t' > t > dim(S'). It suffices to show that, for infinitely many z G N, Yl]=o ^^S l^jl < 
t'rti. Since t > dim(S'), for infinitely many G N, 

d{S \n) > 2(i-*)"d(A). 

A martingale can at most double its capital after every bit, and each index n with < 
n < is at most ki bits beyond n^. It follows that for infinitely many 2 G N, 

d(5 f rii) > 2(^-*)"»-'="d(A). (4.1) 

For all i G N, set /j G M such that d{S \ rn) = 2^'"''d(S' \ Ui^i). By induction on i, 

i 

d{S \m) = d{X)l[2''^-'^. (4.2) 

j=0 
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Then, by equations ()4.1|) and ()4.2p . and the fact that ^*=q A;^ = n^, for infinitely many 

i e N, 

i i i 

i=o i=o i=o 

Recall that d{S \ ni) (l — r^) < rfj. By Corollary 12.51 (take k = ki,l = li,a = 1 — j2,w = 
S \ r2,j_i) and the definition of /j, since 



d,>{l-^\d{S\n,)=(l-^] 2'^-''diS \ n. 



f^i ^7 . . . _ .... 



2'' / 1 \ 

it follows that |Aj| < and so log \Ai\ < U — log I 1 ~ ^ ) • Let co,i = log |y4o| + 

log I All - /o - ^1- Then 

Y.\og\A,\ < ^/^.-^log^l-l'j+Co,! 

j 

< tm + 2ki-y^ (log(j + 1) + log(j - 1) - 2 log j) +co,i 



7=2 ^ 

telescopes 



= t'm + {t- t')ni + 2A;, - (log 1 - log 2 - log z + log(z + 1)) + co,i. 

t < t', 2ki = o(nj), and lim [log(2 + 1) — logi] = 0. Therefore, for infinitely many i, 
E;=olog|A,| <t'n,. □ 



Lemma 4.5. Let i G Z+ c G M^, anc? r G 



l,c2* 



Then there is a rational number 



d G Q+ such that r >(i > r (l - ^) and K{d) = O(logz). 

Proof. We prove the cases r > i"^ and 1 < r < separately. Suppose r > i"^. In this 
case we will choose d to be an integer. Set k G Z+ such that 2^^^ < < 2^^. Since 
r > 2^~\ Rogr] > A; - 1. 

Let d G be the integer whose binary representation is xO^'°^^^~'^, where x G {0, l}'^ 
is the first k bits of [rj . Since d shares its first k bits with r, 

r - < 2ri°s'-l-'= - 1 < - 1< ^, 

so r > c? > r (l — . (i can be fully described by the first k bits of r, along with the 
binary representation of the number [logr] — A; of O's that follow. Thus, describing d 
requires no more than k + log([logr] — A;) < logi^ + 1 + log log c + logi^ = O(logz) bits. 

This will not work if r G Z+ and r's least significant [logr] — k bits are 0, which would 
result in d = r, rather than d < r. In this case, let 

d = r — 1 = bnum (rep2(bnum(x) — ^ 
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where bnum(x) is the integer whose binary representation is x, and rep2(n) is the binary 
representation (with possible leading zeroes) of n G N. This likewise requires O(logz) bits 
to describe. Since r > z^, d = r — 1 > r (l — . 

Now suppose that 1 < r < P. We approximate r by the binary integer \r\, plus a 
finite prefix of the bits to the right of r's decimal point in binary form. If x.S is the 
binary representation of r, where x G {0, 1}* and G C, let d G Z+ be represented by 
x.y, where y ^ S. 

Since r < i^, |x| < logi^ = O(logi). We need r — d< for d to approximate r closely. 
Since r — d < 2^1^', it suffices to choose y ^ S such that 2^1^' < ^, or \y I > log Let 



\y 



log- 



= O(logi), since r > 1. Thus |x| + \y\ = O(logi), so describing d requires 
O(logz) bits. 

This will not work if r is a dyadic rational x.z, where x, 2 G {0,1}* and \z\ < \y\, 
which would result in d = r, rather than d < r. In this case, let r' G [r (l — ) ? i^) be 
irrational. Choose d for r' by the method just described, such that r' > d > r' (^1 — 
and d requires 0(log(i-\/2)) = 0{logi) bits. Then d > r (l — by the triangle inequality, 
and d < r' < r. □ 

Finally, we prove Theorem 14.31 

Proof of Theorem \4 !^ If G RAND, then S <t S via the trivial "bit copier" machine 
M', with lower and upper compression ratio dim(5') = Dim(S') = 1, so assume that 
S ^ RAND. 

A single OTM M" suffices to carry out the reduction described below, no matter 
what sequence S ^ RAND is being computed. If S" G RAND, then M' is used. These two 
separate reductions are easily combined into one by reducing each sequence S" to a random 
sequence bR via M G OTM, where b e {0,1}, R = S if S e RAND, and R is given by 
the construction below if S* ^ RAND. The bit b indicates to M whether to use M' or M" 
for the reduction. Hence a single OTM M implements the "optimal decompression" . 

For all z G N, define fcj = z + 1, and define no = and rii = ni_i + ki = ^^^±1^ for 
i > 0. Note that Ui < for all z > 3. ki represents the length of the z*^ block into which 
we subdivide S. is the total length of the first i + 1 blocks. Define di G Q"*" to be a 
rational number satisfying 

1. d(S' \ Ui) (1 — < di < d{S \ Hi); i.e., di is a rational number approximating 
d(S' \ Hi) from below. 

2. K{di) = o{ki); i.e. di can be computed from a program asymptotically smaller than 
the length of the z*^ block. 

By ObservationEH d{S \ rii) < 2"'d(A) < 2^'d(A) for z > 3. By TheoremlZHl S ^ RAND 
implies that for all but finitely many z, d{S \ rii) > 1- Thus, by Lemma 14.51 (take 
r = d{S \ rii) and c = d(A)), there is a dj G Q"*" satisfying the above two conditions. 
Define the set A C {0, 1}''" for all z G N as in Lemma f4. 41 bv 

Ai = {ue{0,lY^ I d{{S \ n,_^)u) > d,] , 
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the set of all length-Zcj extensions of S \ that add more capital to the optimal 
constructive martingale d than S[ni-i . . — 1] does, to within multiplicative factor at 
most 1 — ^. Since d{S \ n-i) > di, it follows that S[ni^i . . rij — 1] G Aj. 

For all i eN, let jOj G N be the output of the following partial computable procedure, 
when given as input the string S[ni_i . . — 1] G {0, 1}^*: 

STRING-TO-lNDEX(S'[ni_i . . Hi - 1]) 

1 Ai^0 

2 for t = 0,1,2,... 

3 do for each u G {0, 1}^'> - A 

4 doifd{{S \ ni_i)u,t) >di 

5 then add u to 

6 if -u = S[ni_i . .Hi — 1] 

7 then output l^jl and halt 

In other words, Pi is the order in which d{S \ rii) is shown to exceed (i.e., to 
belong to Ai) by a parallel evaluation of d{{S \ ni_i)u,t) on all extensions u G {0, l}''' 
of S \ rii-i, for t = 0, 1,2, . . .. Since di < d{S \ rii), there exists some t G N such that 
d{S f ni,t) > di, and so Pi is well-defined. The computation of Index-To-String, the 
inverse of String-To-Index, resembles that of String-To-Index: 

iNDEX-TO-STRING(pi) 

1 Ai^ 

2 for t = 0,1, 2,... 



3 do for each u G {0, l}'^' - Ai 

4 doifd((S \ ni^i)u,t) > di 

5 then add u to Ai 

6 if \Ai\ = Pi 

7 then output u and halt 



Note that Index-To-String will not halt if given as input an integer greater than \Ai\, 
and String-To-Index will not halt if given a string that is not an element of Ai. 

For all i G N, let 7r((ij) denote a self-delimiting, shortest program for computing di. 
Define the sequence P G C by 

P = enc{po)n{do)enc{pi)n{di)enc{p2)n{d2) 

Define the oracle Turing machine Mg that produces n bits of S, with oracle P, as 
follows. Let i{n) denote the block in which n resides - the unique i E N such that 
Tii <n < Hj+i. First, Mg reads the first i{n) -\- 1 blocks of P: 

enc(po)7r(cio) • . . enc(pj(„))7r((ij(„)). 

Mg then calculates the first i{n) + l blocks of S iteratively. On block i, Mg first computes 
Pi from enc(pi) and di from 7r((ii). Then, Mg evaluates iNDEX-To-STRlNG(pi) to obtain 
S[ni_i . . rii] and outputs it as the i^^ block of S. 
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Since S[ni^i..ni — 1] G A, it follows that pi < \Ai\, and so |enc(pj)| < logjAI + 
2 log log \Ai \ + 3. Therefore, by Lemma f4.4[ 

hm inf ^^=°'''''^^^'^' < hm inf ^^=°^"^'^^'' < dim(5) , (4.3) 

and 

Yl]=o 1 6nc (pj ) I S 7=0 I I 
limsup — < limsup — < Dim(S'). (4.4) 

i — >oo i — ^oo 

By our choice of di, \n{di)\ = o{ki), so Yl]=o = o{ni), giving 

. ,E}=olenc(Pj)7r(d,)| . „ E}=o |enc(p,) | 
hmmf — ^ = hmmt — , (4.5) 

and 

E;=olenc(pj)vr(dj)| E5=o |enc(pj)| 
limsup — = limsup — . (4-o) 

i— >oo TT'i i—*oo f^i 

By the definition of liminf, 

Since rij = M^Hl^ z^;^ = o{ni), so 

E5=i|enc(pj)7r((ij)| E}=o |enc(Pi)7r(rfj)| 
limsup — < limsup — . (4.8) 

n— >oo IT' j— >oo 

In other words, because the block size grows slower than the prefix length, the lim sup over 
all blocks is at least the lim sup over all bits (and they are in fact equal by the definition 
of limsup). Regardless of the block growth rate, this inequality holds trivially for liminf. 

For all 77, G N, Mg requires Y2f=o \^^'^{Pj)^{dj)\ bits of P in order to compute n bits 
of 5", and hence, by inequalities ()4.3|) - ()4.8p . 

Pj...p{b, r) = hmmt — < dim(ii), 

and 

pT.p [S,F) = hm sup — < Dim(6 ) . 

^ 71— >00 

Let R G RAND and Mp G OTM be given by the construction of Gacs in his proof of 
Theorem O satisfying P <t R via Mj? and p+ ^(-P, R) = 1. Let M" = Mf o M^. Then 
S <T R via M" and, by Lemma 14. 

pl,„{S, R) < pl^,{S, P)pl,n{P, R) < dim(5), 
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and 



pts^S, R) < pt,p{S, P)pl,n {P, R) < Dim(5). 



By Lemma Hill p;^^„{S,R) > dim(5') and p^^„(S', i?) > 0^(5). 



□ 



Finally, these results give a new characterization of constructive dimension. 



Theorem 4.6. For every sequence S G C 



dim(5) =p-{S) 



and Dim(S') 



pHs) 



and, for all X C C 



cdim(X) = sup p (S) 

Sex 



and cDim(X) 



sup p^{S). 



Proof. Immediate from Lemma f4. 21 and Theorems 14.31 and 12.1 11 



□ 



It is instructive to compare Theorem 14.61 with Ryabko's Theorem 13.21 considering es- 
pecially what they say about individual sequences. While Ryabko's theorem represents 
S with a more compact sequence R, it is not optimally compact, as a different decoding 
machine is required to get the compression ratio closer and closer to the optimal ratio 
of dim(5'). However, the major difference between the theorems is that Ryabko's con- 
struction does not achieve the bound between p"*" and Dim. Intuitively, Ryabko's theorem 
states that S may be compressed to a sequence R, where infinitely often (but not almost 
everywhere), approximately the first K{S \ n) bits of R suffice to produce S \ n. However, 
Ryabko's construction requires that the block lengths grow exponentially, so that if S is 
written X1X2X3 . . ., then for all z G N, \xi . . .Xi\ < 2~'|xi . . Therefore, while the 

lower compression ratio p~ is close to optimal, the upper compression ratio p"*" goes to 
infinity. 

5 Conclusion 

We have shown that every infinite sequence is Turing reducible to a Martin-Lof random in- 
finite sequence with the optimal compression ratio possible. Since this optimal ratio is the 
constructive dimension of the sequence, this gives a new characterization of constructive 
dimension in terms of Turing reduction compression ratios. 

The Turing reductions of Theorems 13. H 13.21 and 14.31 satisfy the stronger properties 
of the weak truth-table reduction (see jSoaSTj ). which is a Turing reduction in which the 
query usage of the reduction machine M on input n is bounded by a computable function 
of n. For example, l.Oln + 0(1) suffices. Thus, constructive dimension could also be 
defined in terms of weak truth-table reductions. 

As noted in the introduction, for the sequences S and R in Theorems 13.11 and 14.31 it 
is not necessarily the case that R <t S. In other words, though the decompression is 
computable, it is not computably reversible in all cases. For instance, if S is computable, 
then R S, since no sequence R e RAND is computable. For this reason, Theorem 14.31 



17 



does not imply Theorem 13.21 which allows for the reduction to be computably reversed, 
subject to the trade-off that the compression requirements are weakened. It remains open 
whether the compression direction is computable if we drop the requirement that the 
sequence R be random. 
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